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24 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides, polypeptides 
encoded by these polynucleotides, antibodies that bind these polypeptides, uses of 
5 such polynucleotides, polypeptides, and antibodies, and their production. 

Background of the Invention 
Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
10 organelle, contains different proteins essential for the function of the organelle. The 
cell uses "sorting signals," which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelle called the endoplasmic 
15 reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Golgi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomes, 
and the other organelles. 
20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
25 secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 
space by proteolytic cleavage of a "linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins 
30 include the commercially valuable human insulin, interferon, Factor VIII, human 
growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
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the pervasive role of secreted proteins in human physiology, a need exists for 
identifying and characterizing novel human secreted proteins and the genes that 
encode them. This knowledge will allow one to detect, to treat, and to prevent 
medical diseases, disorders, and/or conditions by using secreted proteins or the genes 
5 that encode them. 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, 

10 antibodies, and recombinant and synthetic methods for producing the polypeptides 
and polynucleotides. Also provided are diagnostic methods for detecting diseases, 
disorders, and/or conditions related to the polypeptides and polynucleotides, and 
therapeutic methods for treating such diseases, disorders, and/or conditions. The 
invention further relates to screening methods for identifying binding partners of the 

15 polypeptides. 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 

20 terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 

25 contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. The 
term "isolated" does not refer to genomic or cDNA libraries, whole cell total or 
mRNA preparations, genomic DNA preparations (including those separated by 
electrophoresis and transferred onto blots), sheared whole cell genomic DNA 

30 preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide/sequences of the present invention. 
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In the present invention, a "secreted" protein refers to those proteins capable 
of being directed to the ER, secretory vesicles, or the extracellular space as a result of 
a signal sequence, as well as those proteins released into the extracellular space 
5 without necessarily containing a signal sequence. If the secreted protein is released 
into the extracellular space, the secreted protein can undergo extracellular processing 
to produce a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are at least 15, 

10 at least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 

continuous nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 
kb, 10 kb, 7.5 kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, 
polynucleotides of the invention comprise a portion of the coding sequences, as 
disclosed herein, but do not comprise all or a portion of any intron. In another 

15 embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5* or 3' to the gene of interest in the 
genome). In other embodiments, the polynucleotides of the invention do not contain 
the coding sequence of more than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 
1 genomic flanking gene(s). 

20 As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 

sequence contained in SEQ ID NO:X or the cDNA contained within the clone 
deposited with the ATCC. For example, the polynucleotide can contain the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3' 
untranslated sequences, the coding region, with or without the signal sequence, the 

25 secreted protein coding region, as well as fragments, epitopes, domains, and variants 
of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ED NO:X 

30 was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence.for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
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shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
5 microorganisms for purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those 
polynucleotides capable of hybridizing, under stringent hybridization conditions, to 
sequences contained in SEQ ID NO:X, the complement thereof, or the cDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 

1 0 overnight incubation at 42 degree C in a solution comprising 50% formamide, 5x SSC 
(750 mM NaCl, 75 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x 
Denhardt's solution, 10% dextran sulfate, and 20 ng/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in 0.1 x SSC at about 65 degree C 
Also contemplated are nucleic acid molecules that hybridize to the 

1 5 polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 

20 incubation at 37 degree C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 
0.2M NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml 
salmon sperm blocking DNA; followed by washes at 50 degree C with 1XSSPE, 
0.1% SDS. In addition, to achieve even lower stringency, washes performed 
following stringent hybridization can be done at higher salt concentrations (e.g. 5X 

25 SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
30 commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 
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Fusion proteins having disulfide-linked dimeric structures (due to the IgG) can also be 
more efficient in binding and neutralizing other molecules, than the monomeric 
secreted protein or protein fragment alone. (Fountoulakis et al., J. Biochem. 
270:3958-3964 (1995).) Polynucleotides comprising or alternatively consisting of 
5 nucleic acids which encode these fusion proteins are also encompassed by the 
invention. 

Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 
10 fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if 
the fusion protein is used as an antigen for immunizations. In drug discovery, for 

15 example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hEL-5. (See, 
D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 

20 sequences, such as a peptide which facilitates purification of the fused polypeptide. 
In preferred embodiments, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 9131 1), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 

25 instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the M HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 
(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
30 or the polypeptides of the present invention. 

Vectors^ Host Cells, and Protein Production 
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The present invention also relates to vectors containing the polynucleotide of 
the present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
vector. Retroviral vectors may be replication competent or replication defective. In 
5 the latter case, viral propagation generally will occur only in complementing host 
cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
10 vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
15 name a few. Other suitable promoters will be known to the skilled artisan. The 

expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
20 UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G41 8 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples 
25 of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells 
(e.g., Saccharomyces cerevisiae or Pichia pastoris (ATCC Accession No. 201 178)); 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
30 mediums and conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE- 
9, available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
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pNH16a, pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, 
. Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
5 from Pharmacia. Preferred expression vectors for use in yeast systems include, but are 
not limited to pYES2, pYDl, pTEFl/Zeo, pYES2/GS, pPICZ,pGAPZ, pGAPZalph, 
pPIC9, pPIC3.5, pHIL-D2, pHII^Sl, pPIC3.5K, pPIC9K, and PA0815 (all available 
from Invitrogen, Carlbad, CA). Other suitable vectors will be readily apparent to the 
skilled artisan. 

10 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the 

15 polypeptides of the present invention may in fact be expressed by a host cell lacking a 
recombinantvector. 

A polypeptide of this invention can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 

20 phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can 
25 also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 
insect, and mammalian cells. Depending upon the host employed in a recombinant 
30 production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
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mediated processes. Thus, it is well known in the art that the N-terminal methionine 
encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotes, for some 
5 proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 

In one embodiment, the yeast Pichia pastoris is used to express the 
polypeptide of the present invention in a eukaryotic system. Pichia pastoris is a 
methylotrophic yeast which can metabolize methanol as its sole carbon source. A 
1 o main step in the methanol raetabolization pathway is the oxidation of methanol to 
formaldehyde using 0 2 . This reaction is catalyzed by the enzyme alcohol oxidase. In 
order to metabolize methanol as its sole carbon source, Pichia pastoris must generate 
high levels of alcohol oxidase due, in part, to the relatively low affinity of alcohol 
oxidase for Oj. Consequently, in a growth medium depending on methanol as a main 
15 carbon source, the promoter region of one of the two alcohol oxidase genes {AOX1) is 
highly active. In the presence of methanol, alcohol oxidase produced from the AOX1 
gene comprises up to approximately 30% of the total soluble protein in Pichia 
pastoris. See, Ellis, S.B., et al, Mol. Cell. Biol. 5:1 1 1 1-21 (1985); Koutz, PJ, et al, 
Yeast 5:167-77 (1989); Tschopp, J.F., et al. Nucl. Acids Res. 15:3859-76 (1987). 
20 Thus, a heterologous coding sequence, such as, for example, a polynucleotide of the 
present invention, under the transcriptional regulation of all or part of the ,4.0*7 
regulatory sequence is expressed at exceptionally high levels in Pichia yeast grown in 
the presence of methanol. 

In one example, the plasmid vector pPIC9K is used to express DNA encoding 
25 a Polypeptide of the invention, as set forth herein, in a Pichea yeast system essentially 
as described in "Pichia Protocols: Methods in Molecular Biology," D.R. Higgins and 
J. Cregg, eds. The.Humana Press, Totowa, NJ, 1998. This expression vector allows 
expression and secretion of a protein of the invention by virtue of the strongy40A7 
promoter linked to the Pichia pastoris alkaline phosphatase (PHO) secretory signal 
30 peptide (i.e., leader) located upstream of a multiple cloning site. 
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Many other yeast vectors could be used in place of pPIC9K, such as, pYES2, 
pYDl, pTEFl/Zeo, pYES2/GS, pPICZ, pGAPZ, pGAPZalpha, pPIC9, pPIC3.5, 
pHIL-D2, pHIL-Sl, pPIC3.5K, and PA0815, as one skilled in the art would readily 
appreciate, as long as the proposed expression construct provides appropriately 
5 located signals for transcription, translation, secretion (if desired), and the like, 
including an in-frame AUG as required. 

In another embodiment, high-level expression of a heterologous coding 
sequence, such as, for example, a polynucleotide of the present invention, maybe 
achieved by cloning the heterologous polynucleotide of the invention into an 

10 expression vector such as, for example, pGAPZ or pGAPZalpha, and growing the 
yeast culture in the absence of methanol 

In addition to encompassing host cells containing the vector constructs 
discussed herein, the invention also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammalian origin, that have 

15 been engineered to delete or replace endogenous genetic material (e.g., coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 
sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 

20 regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences 
via homologous recombination, resulting in the formation of a new transcription unit 
(see, e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; U.S. Patent No. 
5,733,761, issued March 31, 1998; International Publication No. WO 96/29411, 
published September 26, 1996; International Publication No. WO 94/12650, 

25 published August 4, 1994; Koller et al., Proc. Natl. Acad. Sci. USA 86:8932-8935 
(1989); and Zijlstra et al., Nature 342:435-438 (1989), the disclosures of each of 
which are incorporated by reference in their entireties). 

In addition, polypeptides of the invention can be chemically synthesized using 
techniques known in the art (e.g., see Creighton, 1983, Proteins: Structures and 

30 Molecular Principles, W.H. Freeman & Co., N.Y., and Hunkapiller et al., Nature, 
310:105-111 (1984)). For example, a polypeptide corresponding to a fragment of a 
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ctgagtgtga ccacgctcag cctcttgctc gccccggtgc tgtggagagc tgcaatcacg 
aggtgtgtgc ccagaccgga gagacggtcc agcctctgat ggctcggaga tgatggaccg 
tggaagggaa gcgtctgtgg ggagtgagcg cttagatggc cagcagctgc tccttctggg 
aagctcgcac cttggcaaca gaacagccct ctagcagagc gtcagtgcag tcgtgttatc 
ccggctttta cagaatattc ttgtcctatt ttagaatttt ccggagtagt ttatttgcag 
tctgttgatt atgtgcagta gacccgggac actgcgtttt accgatcacc ttgaatgtgg 
tgcctggatg tgcctttttt ttttttccct gaaattatta ttaattttct attgtgagtt 
catcagttca tagttttttt agtaaagaag caaaattaaa aggcttttaa aaatgtacaa 
cttcagaatt ataatctgtt agtcaaatat ttgttattaa acatttctgt aatatgaagt 
tgtaatcctg gccgtgagct tggaagctta cttttgattc ttaaagccta tgttttctaa 
aatgagacaa atacggatgt ctatttgcct tttattgtaa cttttaaatg aaataatttc 
atgtcaattt ctattagata tatcacttaa aatatttggt tttaaatcac aagaatatgt 
attctttaat aaagataatt tatgatcatg gtataattaa ttgaaattta ttaaaatctg 
tttttattaa aaaaaaaaaa aaaaaaactc gagggggggc ccggtaccca attcgcccta 
ggaa 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1144 



<210> 59 
<211> 1120 
<212> DNA 

<213> Homo sapiens 
<400> 59 

ggaggagaac gccacctcca tcgaacccat ccgcgacttc ctggccatcg ttttcttcgc 60 

ctccataggg ctccacgtgt tccccacgtt tgtggcgtac gagctcacgg tgctggtgtt 120 

cctcaccttg tcagtggtgg tgatgaagtt tctcctggcg gcgctggtcc tgtctctcat 180 

tctgccgagg agcagccagt acatcaagtg gatcgtctct gcggggcttg cccaggtcag 240 

cgagttttcc tttgtcctgg ggagccgggc gcgaagagcg ggcgtcatct ctcgggaggt 300 

gtacctcctt atactgagtg tgaccacgct cagcctcttg ctcgccccgg tgctgtggag 360 

agctgcaatc acgaggtgtg tgcccagacc ggagagacgg tccagcctct gatggctcgg 420 

agatgatgga ccgtggaagg gaagcgtctg tggggagtga gcgcttagat ggccagcagc 480 

tgctccttct gggaagctcg caccttggca acagaacagc cctctagcag agcgtcagtg 540 

cagtcgtgtt atcccggctt ttacagaata ttcttgtcct attttagaat tttccggagt 600 

agtttatttg cagtctgttg attatgtgca gtagacccgg gacactgcgt tttaccgatc 660 

accttgaatg tggtgcctgg atgtgccttt tttttttttc cctgaaatta ttattaattt 720 

tctattgtga gttcatcagt tcatagtttt tttagtaaag aagcaaaatt aaaaggcttt 780 

taaaaatgta caacttcaga attataatct gttagtcaaa tatttgttat taaacatttc 840 

tgtaatatga agttgtaatc ctggccgtga gcttggaagc ttacttttga ttcttaaagc 900 

ctatgttttc taaaatgaga caaatacgga tgtctatttg ccttttattg taacttttaa 960 

atgaaataat ttcatgtcaa tttctattag atatatcact taaaatattt ggttttaaat 1020 

cacaagaata tgtattcttt aataaagata atttatgatc atggtataat taattgaaat 1080 

ttattaaaat ctgtttttat taaaaaaaaa aaaaaaaaaa 1120 



<210> 60 
<211> 1137 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1112) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1131) 

<223> n equals a,t,g, or c 



